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Abstract 


Covariate adjustment is a widely used approach 
to estimate total causal effects from observational 
data. Several graphical criteria have been de¬ 
veloped in recent years to identify valid covari¬ 
ates for adjustment from graphical causal mod¬ 
els. These criteria can handle multiple causes, 
latent confounding, or partial knowledge of the 
causal structure; however, their diversity is con¬ 
fusing and some of them are only sufficient, but 
not necessary. In this paper, we present a cri¬ 
terion that is necessary and sufficient for four 
different classes of graphical causal models: di¬ 
rected acyclic graphs (DAGs), maximum ances¬ 
tral graphs (MAGs), completed partially directed 
acyclic graphs (GPDAGs), and partial ancestral 
graphs (PAGs). Our criterion subsumes the ex¬ 
isting ones and in this way unifies adjustment set 
construction for a large set of graph classes. 


1 INTRODUCTION 


Which covariates do we need to adjust for when estimat¬ 
ing total causal effects from observational data? Graph¬ 
ical causal modeling allows to answer this question con¬ 
structively, and contributed fundamental insights to the 
theory of adjustment in general. Por instance, a sim¬ 
ple example known as the “M-bias graph” shows that 
it is not always appropri ate to adjust for all obser ved 
(pre-treatment) covariates ( Shriei , 2008t Rubin , 2008 ). A 
few s mall graphs also suffice to refut e the “Table 2 fal¬ 
lacy” ( Westreich and Greenlandl 2013 ). which is the be¬ 
lief that the coefficients in multiple regression models are 
“mutually adjusted”. Thus, causal graphs had substan- 
tial impact on theory a nd practice of covariate adjustment 
( Shrier and Platt . 2008 ). 


The practical importance of covariate adjustment has in¬ 
spired a growing body of theoretical work on graphi¬ 


cal criteria that are sufficient and/or n ecessary for ad- 
justment. Pearl’s back-door criterion (Pearl 1993 ) is 
probably the most well-known, and is sufficient but 


not ne cessary for adjustment in DAGs. IShpitser et al. 


( 2010l) adapted the back-door criterion to a neces¬ 
sary and sufficient graphical criterion for adjustment 
in DAGs. Others considered graph classes other 
than DAGs, which can re present structural uncertainty. 


van der Zander et alJ (120 14^ gave necessary and sufficient 


graphical criteria for MAGs tha t allow for unobserved 
variab les (latent confounding). Maathuis and Colombol 
(12015h presented a generalized back-door criterion 
for DAGs, MAGs, GPDAGs and PAGs, where 
GPDAGs and PAGs represent Markov equivalence classes 


ferred directly from data (see, e.g.. Snirtes et al.. 

200C; 

Chickering. 

2003; 

Colombo et al.. 2012n Claassen et al.. 

20131: Colombo and Maathuisl 20141). The generalized 


back-door criterion is sufficient but not necessary for ad¬ 
justment. 

In this paper, we exte n d the r esults of Shpitser et al.l(l201C ), 
van ^r Zander et ^ ( 2014 ) and Maathuis and Colombol 


( 2015h to derive a single necessary and sufficient adjust¬ 
ment criterion that holds for all four graph classes: DAGs, 
GPDAGs, MAGs and PAGs. 


To illustrate the use of our generalized adjustment crite¬ 
rion, suppose we are given the GPDAG in Pigure[Tt and 
we want to estimate the total causal effect of X on Y. Our 
criterion will inform us that the set {A, Z} is an adjustment 
set for this GPDAG, which means that it is an adjustment 
set in every DAG that the GPDAG represents (FigurefTt). 
Hence, we can estimate the causal effect without knowl¬ 
edge of the full causal structure. In a similar manner, by 
applying our criterion to a MAG or a PAG, we find ad¬ 
justment sets that are valid for all DAGs represented by 
this MAG or PAG. Our criterion finds such adjustment 
sets whenever they exist; else, our knowledge of the model 
structure is insufficient to compute the desired causal ef¬ 
fect by covariate adjustment. We hope that this ability to 
allow for incomplete structural knowledge or latent con- 
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Figure 1: (a) A CPDAG in which, according to our cri¬ 
terion, {A, Z} is an adjustment set for the total causal ef¬ 
fect of X on Y. (b) The Markov equivalence class of (a), 
with node labels removed for simplicity and varying edges 
highlighted. An adjustment set for a CPDAG (PAG) is 
one that works in all DAGs (MAGs) of the Markov equiv¬ 
alence class. 


founding or both will help address concerns that graphical 
causal modelling “ assumes that all 1...] D AGs have been 
properly specified” (IWest and Kochi l2014t) . 


We note that, although we can find all causal effects that are 
identifiable by covariate adjustment, we generally do not 
find all identifiable causal effects, since some effects may 
be identif iable by othe r means, such as Pearl’s front-door 


criterion (iPearl. 


(ITian and Pearl. 


2005, Section 3.3.2) or t he ID algorithm 


2002 : Shpitser and Pearl . 2006h . We also 


point out that MAGs and PAGs are in principle not only 
able to represent unobserved confounding, but can also ac¬ 
count for unobserved selection variables. However, in this 
paper we assume that there are no unobserved selection 
variables. This restriction is mainly due to the fact that 
selection bias often renders it impossible to identify causal 


effects using just covariate adjustment. iBareinboim et al. 


(1201411 discuss these problems and present creative ap¬ 
proaches to work around them, e.g., by combining data 
from different sources. We leave the question whether ad¬ 
justment could be combined with such auxiliary methods 
aside for future research. 


2 PRELIMINARIES 

Throughout the paper we denote sets in bold uppercase let¬ 
ters (e.g., S), graphs in calligraphic font (e.g., Q) and nodes 
in a graph in uppercase letters (e.g., X) 

Nodes and edges. A graph Q = (V, E) consists of a set of 
nodes (variables) V = {Xi, ..., Xp] and a set of edges E. 

There is at most one edge between any pair of nodes, and 
nodes are called adjacent if they are connected by an edge. 
Every edge has two edge marks that can be arrowheads, 
tails or circles. Edges can be directed bidirected o, 
non-directed o-<^ or partially directed We use • as a 


stand in for any of the allowed edge marks. An edge is 
into (out of) a node X if the edge has an arrowhead (tail) 
at X. A directed graph contains only directed edges. A 
mixed graph may contain directed and bi-directed edges. 
A partial mixed graph may contain any of the described 
edges. Unless stated otherwise, all definitions apply for 
partial mixed graphs. 

Paths. A path p from X to U in (y is a sequence of distinct 
nodes {X, ..., E) in which every pair of successive nodes 
is adjacent in Q. A node V lies on a path p \f V occurs 
in the sequence of nodes. The length of a path equals the 
number of edges on the path. A directed path from X to 
y is a path from X to E in which all edges are directed 
towards E, i.e., X —• • • —E. A directed path is also 
called a causal path. A possibly directed path (possibly 
causal path) from X to E is a path from X to E that has 
no arrowhead pointing to X. A path from X to E that is 
not possibly causal is called a non-causal path from X to 
E. A directed path from X to E together with an edge 
E —>• X (E •(-> X) forms an (almost) directed cycle. For 
two disjoint subsets X and Y of V, a path from X to Y is 
a path from some X € X to some E € Y. A path from X 
to Y is proper if only its first node is in X. 

Subsequences and subpaths. A subsequence of a path p is 
a sequence of nodes obtained by deleting some nodes from 
p without changing the order of the remaining nodes. A 
subsequence of a path is not necessarily a path. For a path 
p = (Xi,X 2 ,..., Xm), the subpath from Xi to Xk (1 < 
i<k <m) is thepathp(Xi,Xfe) = (Xj, X^+i,..., X^). 
We denote the concatenation of paths by 0, so that for ex¬ 
ample p = p{Xi,Xk) © p{Xk, Xm)- We use the conven¬ 
tion that we remove any loops that may occur due to the 
concatenation, so that the result is again a path. 

Ancestral relationships. If X Y, then X is a par¬ 
ent of E. If there is a (possibly) directed path from X to 
E, then X is a (possible) ancestor of E, and E is a (pos¬ 
sible) descendant of X. Every node is also a descendant 
and an ancestor of itself. The sets of parents and (possi¬ 
ble) descendants of X in ^ are denoted by Pa(X, Q) and 
(Poss)De(X, (y) respectively. For a set of nodes X C V, 
we have Pa(X, Cy) = Uxex Pa(X, Cy), with analogous 
definitions for (Poss)De(X, Q). 

Colliders and shields. If a path p contains 
Xi»-^Xj<-uXk as a subpath, then Xj is a collider 
on p. A collider path is a path on which every non¬ 
endpoint node is a collider. A path of length one is a trivial 
collider path. A path (Xi, Xj, Xk) is an (un)shielded triple 
if Xi and Xk are (not) adjacent. A path is unshielded if all 
successive triples on the path are unshielded. Otherwise 
the path is shielded. A node Xj is a definite non-collider 
on a path p if there is at least one edge out of Xj on p, or if 
Xi*-oXjO-*Xfc is a subpath of p and {Xi,Xj,Xk) is an 
unshielded triple. A node is of definite status on a path if it 
































is a collider or a definite non-collider on the path. A path 
p is of definite status if every non-endpoint node on p is 
of definite status. An unshielded path is always of definite 
status, but a definite status path is not always unshielded. 

m-separation and m-connection. A definite status path 
p between nodes X and Y is m-connecting given a set of 
nodes Z (X, Y ^ Z) if every definite non-collider on p is 
not in Z, and every collider on p has a descendant in Z. 
Otherwise Z blocks p. If Z blocks all definite status paths 
between X and Y, we say that X and Y are m-separated 
given Z. Otherwise, X and Y are m-connected given Z. 
For pairwise disjoint subsets X, Y, Z of V, X and Y are 
m-separated given Z if X and Y are m-separated by Z for 
any X G X and Y G Y. Otherwise, X and Y are m- 
connected given Z. 


Causal Bayesian networks. A directed graph without 
directed cycles is a directed acyclic graph fDAG). A 
Bayesian network for a set of variables V = {Xi,..., Xp} 
is a pair {Q, /), where Q is a DAG, and / is a joint 
probability density for V that factorizes according to the 
conditional independence relationships described via m- 
separ ation. that is /(V) = Hti fi^i\Po,{Xi, Q)) jPearl . 


20091) . We call a DAG causal when every edge Xi Xj 


in Q represents a direct causal effect of Xi on Xj. A 
Bayesian network (Q, f) is a causal Bayesian network if 
Q is a causal DAG. If a causal Bayesian network is given 
and all variables are observed one can easily derive post¬ 
intervention densities. In particular, we consider interven¬ 
tions do(X = x) (X C V), which re present outside inter¬ 
ventions that set X to X (Pear], 20091) : 


f{v\do{X = x)) = 

( Oxjevxx fiti:i\Pa{xi, Q)), 

[o, 


for values of V 
consistent with x, 
otherwise. 

( 1 ) 


Equat ion (1) is kn own as the trunc ated factorizat ion for¬ 
mula ( Pearl . 20091) or the g-formula ( Robinsl 1986 ). 


Maximal ancestral graph. A mixed graph Q without di¬ 
rected cycles and almost directed cycles is called ances¬ 
tral. A maximal ancestral graph fMAGj is an ances¬ 
tral graph Q = (V, E) where every two non-adjacent 
nodes X and Y in Q can be m-separated by a set Z C 
V \ {X,Y}. A DAG with unobserved variables can be 
uniquely represented by a MAG that preserves the an¬ 
cestral an d m-separation relationships am ong the observed 
variables ( Richardson and Spirtes . 2002). The MAG of a 
causal DAG is a causal MAG. 


Markov equivalence. Several DAGs can encode the same 
conditional independence information via m-separation. 
Such DAGs form a Markov equivalence class which can 
be described uniquely by a completed partially directed 



Figure 2: Two configurations where the edge X —^ Y is 
visible. 


acyclic graph fCPDAGj . Several MAGs can also en¬ 
code the same conditional independence information. Such 
MAGs form a Markov equivalence class which can be 
described uniquely by a partial ancestral gra ph fPAGj 
( Richardson and Spirtes , 2002t Ali et al.L 12009 ). We de¬ 
note all DAGs (MAGs) in the Markov equivalence class 
described by a CPDAG (PAG) Q by [Q]. 


Consistent density. A density / is consistent with a causal 
DAG V if the pair (T>, /) forms a causal Bayesian network. 
A density / is consistent with a causal MAG Ai if there ex¬ 
ists a causal Bayesian network {V, /'), such that Ai rep¬ 
resents V and / is the observed marginal of /'. A density 
/ is consistent with a GPDAG (PAG) Q if it is consistent 
with a DAG (MAG) in [(/]. 


Visible and invisible edges. All directed edges in DAGs 
and GPDAGs are said to be visible. Given a MAG Ad or a 
PAG G, a directed edge X —> Y is visible if there is a node 
V not adjacent to Y such that there is an edge between V 
and X that is into X, or if there is a collider path from V to 
X that is into X and every non-endpoint node on the path 
is a parent of Y. Otherwise, X —Y is said to be invisible 
( Zhang , 20061: Maathuis and Colombol 2015 ). 


A directed visible edge X —Y means that there are no 
latent confounders between X and Y. 


3 MAIN RESULT 

Throughout, let G = (V, E) represent a DAG, GPDAG, 
MAG or PAG, and let X, Y and Z be pairwise disjoint 
subsets of V, with X 7 ^ 0 and Y 7 ^ 0. Here X represents 
the intervention variables and Y represents the set of re¬ 
sponse variables, i.e., we are interested in the causal effect 
ofXon Y. 

We define sound and complete graphical conditions for ad¬ 
justment sets relative to (X, Y) in G- Thus, if a set Z satis¬ 
fies our conditions relative to (X, Y) in G (Definition 13.31 1. 
then it is a valid adjustment set for calculating the causal 
effect of X on Y (Definition [3T]l, and every existing valid 
adjustment set satisfies our conditions (see Theorem l3.41 l. 
First, we define what we mean by an adjustment set. 


































Defin ition 3.1. (Adjustment set; Maathuis and Colombd 
2015 ) Let Q represent a DAG, CPDAG, MAG or 
PAG. Then Z is an adjustment set relative to fX,Yj 
in Q if for any density f consistent with Q we have 

/(y|<io(x)) = |{(^W rZ = 0, 

=-Ez{/(y|z,x)} otherwise. 

Ifyi = {X} and'Y = {Y}, we call Z an adjustment set 
relative to (A, Y) in the given graph. 


To define our generalized adjustment criterion, we intro¬ 
duce the concept of amenability: 

Definition 3.2. (Amenability for DAGs, CPDAGs, 
MAGs and PAGs)A DAG, GPDAG, MAG or PAG G is 
said to be adjustment amenable, relative to (X, Y) if every 
possibly directed proper path from X fo Y in Q starts with 
a visible edge out ofK.. 


For conciseness, we will also write “amenable” instead of 
“adjustment amenable”. The intuition behind the concept 
of amenability is the following. In MAGs and PAGs, di¬ 
rected edges A —^ Y can represent causal effects, but 
also mixtures of causal effects and latent confounding; in 
GPDAGs and PAGs, there are edges with unknown di¬ 
rection. This complicates adjustment because paths con¬ 
taining such edges can correspond to causal paths in some 
represented DAGs and to non-causal paths in others. For 
instance, when the graph A ^ Y is interpreted as a DAG, 
the empty set is a valid adjustment set with respect to 
(A, Y) because there is only one path from A to Y, which 
is causal. When the same graph is however interpreted as a 
MAG, it can still represent the DAG A ^ Y, but also 
for example the DAG A ^ Y with a non-causal path 
A ^ L ^ Y where L is latent. A similar problem arises 
in the GPDAG Ao^Y. 
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Figure 3: (a) PAG P, (b) MAG Mi, (c) MAG M 2 used 
in Example l4.2l 


Note that condition (0) does not depend on Z. In other 
words, if condition (0) is violated, then there is no set 
Z' C V \ (X U Y) that satisfies the generalized adjust¬ 
ment criterion relative to (X, Y) in Q. 

Condition (1) defines a set of nodes that cannot be used in 
an adjustment set. Denoting this set of forbidden nodes by 

Fp(X, Y) = {lY' G V : lY' G PossDe(lY, G) for some 
lY ^ X which lies on a proper possibly 
causal path from X to Y}, (2) 

condition (1) can be stated as; ZnFp(X, Y) = 0. We will 
sometimes use this notation in examples and proofs. 

We now give the main theorem of this paper. 

Theorem 3.4. Let G represent a DAG, GPDAG, MAG or 
PAG. Then Z is an adjustment set relative to (X, Y) in G 
(Definition \3.R if and only ifZ satisfies the generalized ad¬ 
justment criterion relative to (X, Y) in G (Definition \3.3\) . 


4 EXAMPLES 


We will show that for a graph G that is not amenable rela¬ 
tive to (X, Y), there is no adjustment set relative to (X, Y) 
in the sense of Definition 13.II (see Lemma l5^ . Note that 
every DAG is amenable, since all edges in a DAG are vis¬ 
ible and directed. For MA Gs, our notion of amenabilit y 
reduces to the one defined by van der Zander et al. ( 20141) . 


We now introduce our Generalized Adjustment Criterion 
(GAC) for DAGs, CPDAGs, MAGs and PAGs. 
Definition 3.3. (Generalized Adjustment Criterion 

(GAC)) Let G represent a DAG, GPDAG, MAG or PAG. 
Then Z satisfies the generalized adjustment criterion 
relative to (X, Y) in G if the following three conditions 
hold: 


(0) G is adjustment amenable relative to fX, Y ), and 

(1) no element in Z is a possible descendant in G of any 
lY G V \ X which lies on a proper possibly causal 
path from X to Y, and 

(2) all proper definite status non-causal paths in G from 
X fo Y are blocked by Z. 


We now provide some examples that illustrate how the gen¬ 
eralized adjustment criterion can be applied. 

Example 4.1. We first return to the example of the Intro¬ 
duction. Consider the GPDAG C in Figure\Jji. Note that C 
is amenable relative to (X,Y) and that Fc(A, Y) = {Y}. 
Hence, any node other than X and Y can be used in an 
adjustment set. Note that every definite status non-causal 
path p from X to Y has one of the following paths as a 
subsequence: pi = {X,Z,Y) and P 2 = (X, A, B,Y), 
and nodes on p that are not on pi or p 2 are non-colliders 
on p. Hence, if we block pi and p 2 , then we block all defi¬ 
nite status non-causal paths from X to Y. This implies that 
any superset of {Z,A} and {Z,B} is an adjustment set 
relative to (A, Y) in C, and all adjustment sets are given 
by: {Z, A}, {Z, B}, {Z, A, /}, {Z, B, /}, {Z, A, B} and 
{Z,A,B,I}. 

Example 4.2. To illustrate the concept of amenability, con¬ 
sider Figure\^with a PAG V in (a), and two MAGi Mi 
and M 2 in \P] in (b) and (c). Note that V and Mi are 
not amenable relative to (A, Y). For V this is due to the 
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Figure 4: (a) PAG Pi, (b) PAG P 2 used in Example 1431 
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Figure 5: (a) GPDAG C, (b) PAG P used in Example l4Al 


path Xo-oY, and for Mi this is due to the invisible edge 
X ^ Y. On the other hand, M 2 is amenable relative to 
{X, Y), since the edges X ^ Y and X ^ V 2 are visible 
due to the edge Vi ^ X and the fact that Vi is not adja¬ 
cent to Y or ¥ 2 - Since there are no proper definite status 
non-causal paths from X to Y in M 2 , it follows that the 
empty set satisfies the generalized adjustment criterion rel¬ 
ative to (X, Y) in M 2 - Finally, note that Mi could also be 
interpreted as a DAG. In that case it would be amenable 
relative to {X,Y). This shows that amenability depends 
crucially on the interpretation of the graph. 

Example 4.3. Let Vi and V 2 be the PAGi in Figure^a) 
and Figure^b), respectively. Both PAGi are amenable 
relative to {X,Y). We will show that there is an adjust¬ 
ment set relative to (X, Y) in Vi but not in P 2 . This illus¬ 
trates that amenability is not a sufficient criterion for the 
existence of an adjustment set. 

We first consider Vi. Note that F-pfX,Y) = {V/i,Y} is 
the set of nodes that cannot be used for adjustment. There 
are two proper definite status non-causal paths from X to 
Y in Pi.- ^ Y and X ^ Vi ^ V 3 ^ Y. 

These are blocked by any set containing V3. Hence, all 
sets satisfying the GAC relative to (X, Y) in Vi are: {V3}, 
{ViMs}, {V 2 , V 3 } and {Vi,V 2 , V 3 }. 

We now consider P 2 . Note that F-p^iX^Y) = 
F-pj {X, Y) = {Vi,Y}. There are three proper definite sta¬ 
tus non-causal paths from XtoY in P 2 .- pi = A o V 3 — 
Y, P 2 — X -O' V 3 ■O' V 4 —y Y and p^ = X —y V 4 ■o 
V 3 O Y. To block pi, we must also use V 3 . This implies 
that we must use Vi to block p 2 . But V 4 G Fp,^{X,Y). 
Hence, there is no set Z that satisfies the GAC relative to 
(X,Y) in P 2 . 


Example 4.4. Let X = {Ai, A 2 } and Y = {A} and 
consider the GPDAG C and the PAG P in Figures^a) 
and \^b). We will show that for both graphs there is 
no set that satisfies the generaU zed back-door criterion 
of \Maathuis and Colombo 1 2015 ) relative to (X, Y), but 
there are sets that satisfy the generalized adjustment crite¬ 
rion relative to (X, Y) in these graphs. 


Recall that a set Z satisfies the generalized back-door cri¬ 
terion relative to (X, Y) and a GPDAG fPAGj Q if 7i 
contains no possible descendants ofX. in Q and if for every 
A € X the set Z U X \ {A} blocks every definite status 
path from X to every Y G Y in Q that does not start with 
a visible edge out of X. 


We first consider the GPDAG C. To block the path X 2 o 
V 2 Y, we must use node V 2 , but V 2 G PossDe(Ai, C). 
Hence, no set Z can satisfy the generalized back-door cri¬ 
terion relative to (X, Y) in C. However, {Vi, V 2 } satisfies 
the generalized adjustment criterion relative to (X, Y) in 
C. 


We now consider P. To block the path X2 ^ V2 Y, 
we must use node V 2 . But, V 2 G De(Ai,P) and thus 
there is no set satisfying the generalized back-door cri¬ 
terion relative to (X,Y) in P. However, sets {Vi,V 2 }, 
{Vi,V2,V3}, {Vi,V2,Vi}, {Vi,V2,V3, Vi} all satisfy the 
generalized adjustment criterion relative to (X, Y) in P. 


5 PROOF OF THEOREM |33] 

For DAGs and MAGs, our generalized adjustment crite¬ 
rion reduces to the following adjustment criterion: 

Definition 5.1. (Adjustment Criterion (AC)) Let Q = 

(V,E) represent a DAG or MAG . Then Z satisfies the 
adjustment criterion relative to (X, Y) in Q if the follow¬ 
ing three conditions hold: 

(0*) Q is adjustment amenable with respect to (X, Y), and 

(a) no element in Z is a descendant in Q of any W G 
V \ X which lies on a proper causal path from X to 
Y, and 

(b) all proper non-causal paths in Q from X fo Y are 
blocked by Z. 


This adjustment criterion is a slightly reformu¬ 
lated but equivalent ve rs ion o f the adjustment cri- 
te rion of Shnitseret^ (l2010f) for DAGs and of 


van der Zander et al.l (120141) for MAGs, with amenabil¬ 
ity directly included in the criterion. This adjustment 
criterio n was shown to b e sound and c omplete for 
DAGs (Shpitser et al. . 20ld Shpitsei , 2012 ) and MAGs 
( van der Zander et al.[ l2014lf We therefore only need to 
prove Theorem l3.4l for GPDAGs and PAGs. 










































Lemma l5^ 


To this end, we need three main lemmas, given below. 
Throughout, we let Q = (V, E) represent a CPDAG or 
a PAG, and we let X, Y and Z be pairwise disjoint subsets 
of V, with X ^ 0 and Y 7 ^ 0. We use GAG and AC to re¬ 
fer to the generalized adjustment criterion (Definition 13.31 l 
and adjustment criterion (Dehnition l5.11 l. respectively. 

Lemma l5^ is about condition (0) of the GAG; 

Lemma 5.2. If a CPDAG (PAGj Q satisfies condition (0) 
of the GAC relative to (X,Y), then every DAG (MAGj 
in [Q] satisfies condition (0*) of the AC relative to (X, Y). 
On the other hand, if Q violates condition (0) of the GAC 
relative to (X, Y), then there exists no set Z' C V \ (X U 
Y) that is an adjustment set relative to (X, Y) in Q (see 
Definition \3.1\l . 

Next, we assume that Q satisfies condition (0) of the GAC 
relative to (X, Y). Under this assumption, we show that 
Z satisfies conditions (1) and (2) of the GAC relative to 
(X, Y) in ty if and only if Z satisfies conditions (a) and (b) 
of the AC relative to (X, Y) in every DAG (MAG) in [Q], 
This is shown in two separate lemmas: 

Lemma 5.3. Let condition (0) of the GAC be satisfied rela¬ 
tive to (X, Y) in a CPDAG (PAG) Q. Then the following 
two statements are equivalent: 

• Z satisfies condition (1) of the GAC relative to (X, Y) 
in Q. 

• Z satisfies condition (a) of the AC relative to (X, Y) 
in every DAG (MAG) in [Q], 

Lemma 5.4. Let condition (0) of the GAC be satisfied rel¬ 
ative to (X, Y) in a CPDAG (PAG) G, and let Z satisfy 
condition (1) of the GAC relative to (X, Y) in Q. Then the 
following two statements are equivalent: 


Lemma 15.51 

i 

Lemma 1531 - *■ Theorem l3.4l 

t 

Lemma 15)71 - >- Lemma 1578 ] - Lemma 157 ^ 

Figure 6 : Proof structure of Theorem l3.4l 

To prove the other direction, suppose that the tuple G, X, 
Y, Z does not satisfy all conditions of the GAC. First, sup¬ 
pose that G violates condition (0) relative to (X, Y). Then 
by Lemma lST^ there is no adjustment set relative to (X, Y) 
in G, and hence Z is certainly not an adjustment set. 

Otherwise, Z must violate condition (1) or (2) of the GAC 
relative to (X, Y). By applying Lemmas 15.31 and [5741 in 
turn, this implies that there is a DAG V (MAG A4) in [G] 
such that Z violates conditions (a) or (b) of the AC relative 
to (X, Y) in V (M). Since the AC is sound and complete 
for DAGs and MAGs, this implies that Z is not an adjust¬ 
ment set relative to (X, Y) in V (AA), so that Z is certainly 
not an adjustment set relative to (X, Y) in G- 

□ 

5.1 PROOF OF LEMMA 1572] 

The proof of Lemma 5.2 is based on the following lemma: 

Lemma 5.5. Let X and Y be nodes in a PAG V, such that 
there is a possibly directed path p* from X to Y in V that 
does not start with a visible edge out of X. Then there is 
a MAG A4 in \P\ such that the path p in AA, consisting of 
the same sequence of nodes as p* in V, contains a subse¬ 
quence that is a directed path from XtoY starting with an 
invisible edge in A4. 


• Z satisfies condition (2) of the GAC relative to (X, Y) 
in G- 

• Z satisfies condition (b) of the AC relative to (X, Y) 
in every DAG (MAG) in [G]. 


The proofs of Lemmas 15.2115.31 and 15.41 are discussed in 
Sections I57ni53l and 15.31 respectively. Some proofs re¬ 
quire additional lemmas that can be found in the supple¬ 
ment. The proof of Lemma 15.41 is the most technical, and 
builds on the work of Zhan 


a 15.41 is th 

^ (l2006h . 


The proof of Lemma lSTSl is given in the supplement. 

Proof of Lemma [5721 First suppose that G satishes condi¬ 
tion (0) of the GAC relative to (X, Y), meaning that every 
proper possibly directed path from X to Y in 0 starts with 
a visible edge out of X. Any visible edge in G is visible 
in all DAGs (MAGs) in [G], and any proper directed path 
in a DAG (MAG) in [G] corresponds to a proper possibly 
directed path in G- Hence, any proper directed path from X 
to Y in any DAG (MAG) in [0] starts with a visible edge 
out of X. This shows that all DAGs (MAGs) in [G] satisfy 
condition (0*) of the AC relative to (X, Y). 


Figure | 6 ] shows how all lemmas fit together to prove Theo¬ 
rem [374] 

Proof of Theorem 13.41 First, suppose that the CPDAG 
(PAG) G and the sets X, Y and Z satisfy all conditions of 
the GAC. By applying Lemmas l5.2115.31 and 15.41 in turn, it 
directly follows that all conditions of the AC are satisfied 
by X, Y and Z and any DAG (MAG) in [0]. 


Next, suppose that G violates condition (0) of the GAC rel¬ 
ative to (X, Y). We will show that this implies that there is 
no set Z' C V \ (X U Y) that is an adjustment set relative 
to (X, Y) in G- We give separate proofs for CPDAGs and 
PAGs. 

Thus, let G represent a CPDAG and suppose that there is 
a proper possibly directed path p from a node X C X to a 











node y G Y that starts with a non-directed edge (o-°). 

Let p' — {X, Vi,... ,Y) (where Vi = Y is allowed) be 
a shortest subsequence of p such that p' is also a proper 
possibly directed path from Y to Y starting with a non- 
directed edge in Q. We hrst show that p' is a dehnite status 
path, by contradiction. Thus, suppose that p' is not a deh¬ 
nite status path. Then the length of p' is at least 2, and we 
write p' = {X, Vi^... ,Vk = Y) for k > 2. Since the sub¬ 
path p'{Vi, Y) is a dehnite status path (otherwise we can 
choose a shorter path), this means that Vi is not of a deh¬ 
nite status on p'. This implies the existence of an edge be¬ 
tween X and V 2 . This edge must be of the form X V 2 , 
since X 0-0 V 2 implies that we can choose a shorter path, 
and X ^ V 2 t ogethe r with Xq-qV] implies Vi ■(— V 2 by 
Lemma 1 from Meek ( 1995h (see Section [T] of the supple¬ 
ment), so that p' is not possibly directed from X to Y. But 
the edge X V 2 implies t hat Vi —>■ V 2 , since otherwise 
Lemma 1 from Meek ( 1995h implies X ^ Vi. But then Vi 
is a dehnite non-collider on p', which contradicts that Vi is 
not of dehnite status. 

Hence, p' is a proper possibly dire cted dehnite status path 
from X to Y. By Lemma 7.6 from lMaathuis and Colombo 
( 2015h (see Section [T] of the supplement), there is a DAG 
Vi in [Q] such that there are no additional arrowheads into 
X, as well as a DAG V 2 in [Q] such that there are no addi¬ 
tional arrowheads into Vi. This means that the paths corre¬ 
sponding to p' are oriented as = X ^ Yi ■ • ■ —> Y 
and p '2 = X Vi ^ ^ Y inVi and 772- An adjust¬ 

ment set relative to (X, Y) in 772 must block the non-causal 
path P 2 , by using at least one of the non-endpoints nodes on 
this path. But all these nodes are in Y) (see (|2|). 

Hence, there is no set Z' C V \ (X U Y) that satishes the 
AC relative to (X, Y) in 77^ and 772 simultaneously. Since 
the AC is sound and complete for DAGs, this implies that 
there is no Z' C V \ (X U Y) that is an adjustment set 
relative to (X, Y) in Q. 

Finally, let Q represent a PAG and suppose that there is a 
proper possibly directed path p from some X G X to some 
Y G Y that does not start with a visible edge out of X in 

By Lemma 15.51 there is a subsequence p' of p such that 
there is a MAG M in [(?] where the corresponding path 
is directed from X to Y and starts with an invisible edge. 
The n Xi is not amenable relative to (X,Y). ByLemma5.7 
from ivan der Zander et alJ (120141) (see Section[T]of the sup¬ 
plement) this means that there is no set Z' C V \ (X U Y) 
that is an adjustment set relative to (X, Y) in M. Hence, 
there is no set Z' C V \ (X U Y) that is an adjustment set 
relative to (X, Y) in Q. □ 


5.2 PROOF OF LEMMA 1531 

Proof of Lemma I5.3t First, suppose that Z satishes con¬ 
dition (1) of the GAC relative to (X,Y) in Q. Then 
ZnFg(X,Y) = 0. Since Fi,(X,Y) C Fg(X,Y) 
(FAr(X, Y) C Fp(X, Y)) for any DAG 77 (MAG M) 
in [(/], it follows directly that Z satishes condition (a) of 
the AC relative to (X, Y) in all DAGs (MAGs) in [Q]. 

To prove the other direction, suppose that Q satishes con¬ 
dition (0) of the GAC relative to (X, Y), but that Z does 
not satisfy condition (1) of the GAC relative to (X, Y) in Q. 
Then there is a node Y G ZnFp(X, Y), i.e., Y G Z and Y 
is a possible descendant of a node lY on a proper possibly 
directed path from some X G X to some Y G Y in (7. We 
denote this path hy p = (X, Yi,..., Yfc, Y), where fc > 1 
and lY G {Yi,..., Yc}. Then the subpaths q — p(X, lY) 
and r = p{W, Y) are also proper possibly directed paths. 
Moreover, there is a possibly directed path s from lY to Y, 
where this path is allowed to be of zero length (if lY = Y). 
We will show that the existence of these paths implies that 
there is a DAG 77 (MAG Af) in [Q] such that Z violates 
condition (a) of the AC relative to (X, Y) in 77 (At). 

By Lemma B.l from Zhang ( 2008h (see Section [T] of the 
supplement), there are subsequences q', r' and s' of q, r and 
s that are unshielded proper possibly directed paths (again 
s' is allowed to be a path of zero length). Moreover, q' 
must start with a directed (visible) edge, since otherwise the 
concatenated path q' © r', which is again a proper possibly 
directed path from X to Y, would violate condition (0) of 
the GAC. 


Lemma B.l from IZhangi (l2008h then implies that q' is a 
directed path from X to lY in Q. Hence, the path corre¬ 
sponding to q' is a directed path from X to lY in any DAG 
(MAG) in [0]. 

By Lemma 7.6 from Maathuis and Colombol ( 2015 ). there 
is at least one DAG 77 (MAG Af) in [Q] that has no addi¬ 
tional arrowheads into lY. In this graph 77 (Af), the path 
corresponding to r' is a directed path from lY to Y, and the 
path corresponding to s' is a directed path lY to Y. Hence, 
Y G FpjX, Y) (Y G F^(X, Y)), so that Z does not sat¬ 
isfy condition (a) of the AC relative to (X, Y) in 77 (Al). 
□ 

5.3 PROOF OF LEMMA ISAl 

We hrst dehne a distance between a path and a set in Def¬ 
inition 15.61 We then give the proof of Lemma 15.41 This 
proof relies on Lemma lSAl and Lemma lSTSl which are given 
later in this section. 


Definition 5.6. (Distance-from-Z: \Zhana \200w Given a 
path pfrom XtoY that is m-connecting given Z in a DAG 
or MAG, for every collider Q on p, there is a directed path 
(possibly of zero length) from Q to a member ofZ. Define 
the distance-from-Z of Q to be the length of a shortest 






































directed path (possibly of length O)from Q to Z, and define 
the distance-from-Z of p to be the sum of the distances 
from Z of the colliders on p. 


Proof of Lemma I5.4t Let Q represent an amenable 
CPDAG (PAG) that satisfies condition (0) of the GAC rel¬ 
ative to (X, Y), and let Z satisfy condition (1) of the GAC 
relative to (X, Y) in Q. 

We first prove that if Z does not satisfy condition (2) of 
the GAC relative to (X, Y) in Q, then Z does not satisfy 
condition (b) of the AC relative to (X,Y) in any DAG 
(MAG) in [Q], Thus, assume that there is a proper definite 
status non-causal path p from A G X to Y G Y that is 
m-connecting given Z in Q. Consider any DAG V (MAG 
A4} in [Q], Then the path corresponding to p in (Ai) is 
a proper non-causal m-connecting path from X to Y given 
Z. Hence, Z violates condition (b) of the AC relative to 
(X,Y) andV(M). 

Next, we prove that if Z violates condition (b) of the AC 
relative to (X, Y) in some DAG (MAG) in [Q\, then Z 
violates condition (2) of the GAC relative to (X, Y) in Q. 
Thus, assume that there is a DAG V (MAG M) in [Q] such 
that there is a proper non-causal m-connecting path from X 
to Y in (Ai) given Z. We choose a shortest such path p, 
such that no equally short proper non-causal m-connecting 
path has a shorter distance-from-Z than p. By Lemma lS^ 
below, the corresponding path p* in Q is an m-connecting 
proper definite status non-causal path from X to Y given 
Z. Hence Z violates condition (b) of the GAC relative to 
(X,Y)ina □ 

Lemma 5.7. Let Ai represent a MAG (DAG) and let V 
be the PAG (CPDAG) of Ai. Let V satisfy condition (0) 
of the GAC relative to (X, Y), and let Z satisfy condition 
(1) of the GAC relative to (X, Y) in V. Let p be a shortest 
proper non-causal path from "K to'Y that is m-connecting 
given Z in A4 and let p* denote the corresponding path 
constituted by the same sequence of variables in V. Then 
p* is a proper definite status non-causal path in V. 


Lemma|A7]is related to Lemma 1 from Zhan3 ( 20061) . The 
proof of Lemma l5^ is given in the supplement. 


Lemma 5.8. Let Ai represent a MAG (DAG) and let V 
be the PAG (CPDAG) of Ai. Let V satisfy condition (0) 
of the GAC relative to (X, Y), and let Z satisfy condition 
(1) of the GAC relative to (X, Y) in V. Let p be a shortest 
proper non-causal path from "K. to Y that is m-connecting 
given Z in Ai, such that no equally short such path has 
a shorter distance-from-Z than p. Let p* denote the cor¬ 
responding path constituted by the same sequence of vari¬ 
ables in V. Then p* is a proper definite status non-causal 
path from X fo Y that is m-connecting given Z in V. 


Lemma lS^ is is 


related to Lemma 2 from Zhang ( 2006h . 


Proof of Lemma 15.81 By Lemma [ATl P* is a proper defi¬ 


nite status non-causal path in V. It is only left to prove that 
p* is m-connecting given Z in V. 

Every definite non-collider on p* in V corresponds to a 
non-collider on p in AI, and every collider on p* is also 
a collider on p. Since p is m-connecting given Z, no non¬ 
collider is in Z and every collider has a descendant in Z. 
Let Q be an arbitrary collider (if there is one). Then there 
is a directed path (possibly of zero length) from Q to a node 
in Z in A^. Let c? be a shortest such path from Q to a node 
Z G Z. Let d* denote the corresponding path in V, consti¬ 
tuted by the same sequence of variables. Then d* is an un¬ 
shielded possibly direc ted path from Q to Z in V (Lemma 
B.l from lzhang|(l2008h ). 


It is only left to show that d* is a directed path. If d* is 
of zero length, this is trivially true. Otherwise, suppose for 
contradiction that there is a circle mark on d*. Then d* 
must start w ith a circle mark at Q (cf. Lemma B.2 from 
Zhang , I 2 OO 8 I and Lemma 7.2 from Maathuis and Colombol 


2015t see Section[T]of the supplement). 


Let S be the first node on d after Q. If S' is not a node on 
p, then following the proof of Lemma 2 from Zhang! (l2006h 
there is a path p' = p(X, W) © 1Y»->-Sg-»P © p{V, Y), 
where W and V are nodes distinct from Q on p{X, Q) and 
p{Q,Y) respectively and p' is m-connecting given Z in Ad. 
Since p' is non-causal and shorter than p, or as long as p but 
with a shorter distance-from-Z than p, the path p' must be 
non-proper, i.e. S G X. But, in that case the path (S, V) © 
p{V, Y) is a proper non-causal m-connecting path from X 
to Y given Z that is shorter than p in Ad. This contradicts 
our assumption about p. 


If S is a node on p, then it lies either on p{X, Q) or 
p{Q, Y). Assume without loss of generality that S is on 
p(Q, Y ). Following the proof of Lemma 2 from IZhang 
(l20()6l) . there exists a path p' = p{X,W) © © 

p{S,Y) in Ai, where IP is a node on p{X,Q) distinct 
from Q that is m-connecting given Z in Ad. Since p' 
is proper, and shorter than p, or as long as p but with 
a shorter distance-from-Z than p, the path p' must be 
causal in Ad. Let p'* denote the corresponding path con¬ 
stituted by the same sequence of variables in V. Then 
p'* is a possibly causal path and Z G PossDe(S', V), so 
Z G F'p(X, Y) n Z. This is in contradiction with our as¬ 
sumption of Z satisfying condition (1) of the GAC relative 
to (X,Y) inV. 


Thus, the path d* is directed and Q is an ancestor of Z in 
V. This proves that p* is a proper definite status non-causal 
path from X to Y that is m-connecting given Z in Ad. 


□ 





































6 DISCUSSION 


We have derived a generalized adjustment criterion that is 
necessary and sufficient for adjustment in DAGs, MAGs, 
CPDAGs and PAGs. Our criterion unifies existing crite¬ 
ria for DAGs and MAGs, and provides a new result for 
CPDAGs and PAGs, where only a sufficient criterion ex¬ 
isted until now. This is relevant in practice, in particular 
in combination with algorithms that can learn CPDAGs or 
PAGs from observational data. 


Our generalized adjustment criterion is stated in terms of 
paths that need to be blocked, which is intuitively ap¬ 
pealing. A logical next step for future research would 
be to transform our criterion into an algorithmically con¬ 
structive version that could be used to efficiently perform 
tasks like enumeration of all minimal adjustment sets for a 
given grap h. This has already been do ne for DAGs and 
MAGs by Ivan der Zander et ^ (l2014l) . and we strongly 
suspect that their results can be extended to CPDAGs 
and PAGs as well. In a similar spirit, it would be desir¬ 
able to have an easily checkable condition to determine 
if there exists any adjustment set at all, as done for the 
generalized back-door criterio n for single interventions by 
Maathuis and Colombol l ioT^ . In turn, these results could 


then be used to characterize d i stance s between graphs, as 
done bv IPeters and Biihlmannl (l20I5h . Future work might 
also explore under which circumstances our restriction to 
not allow for latent selection variables might be relaxed, or 
whether our criterion could be combined with meth ods to 
recover from selection bias dBareinboim et al. . 2014 ). 


As pointed out in Section]?] our criterion sometimes has 
to interpret PAGs or MAGs differently than DAGs or 
CPDAGs. This is the case precisely when the first edge on 
some proper possibly causal path in a MAG or PAG is not 
visible. However, this difference in interpretation is irrele¬ 
vant for DAGs or CPDAGs that would be amenable when 
viewed as a MAG or PAG. For instance, if we are given a 
DAG V that is amenable when interpreted as a MAG M., 
then its adjustment sets also work for every DAG that the 
MAG M represents, many of which could contain latent 
confounding variables. Reading a DAG as a MAG (or a 
CPDAG as a PAG) can thus allow computing adjustment 
sets that are to some extent invariant to confounding. 


There is also an interesting connection between amenabil¬ 
ity and instrumental variables; a MAG or PAG Q with 
X = {X} is amenable with respect to (X, Y) whenever 
it contains an instrument I, i.e. there exists a variable that 
is a parent of X but not a parent of any child of X (e.g., I in 
Figure]!^). Thus, instruments are useful to find adjustment 
sets in nonparametric graphical models that allow for latent 
confounding. This connection is perhaps surprising given 
that the notion of instruments o riginates from causa l effect 
identifications in linear models ( Angrist et al. . 1996h . 


In summary, our generalized adjustment criterion exhaus¬ 
tively characterizes the options to identify total causal ef¬ 
fects by covariate adjustment in DAGs, MAGs, CPDAGs, 


or less powerful ones dPear 

^ -- ^ -, 

1993; Shpitseretal 

201C 

Textor and Liskiewiczl |201 1 

van der Zander et al.. 

2014 

Maathuis and Colombo! 2015) as special cases. 
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